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Abstract 

We characterize the languages in the individual levels of the quantifier 
alternation hierarchy of first-order logic with two variables by identities. 
This implies decidability of the individual levels. More generally we show 
that the two-sided semidirect product of a decidable variety with the va- 
riety J is decidable. 

1 Introduction 

It has been known for some time (Kamp [6., Immerman and Kozen [5]) that 
every first-order sentence over the base < defining properties of finite words 
is equivalent to one containing only three variables. The fragment F0 2 [<] of 
sentences that use only two variables, has been the object of intensive study; 
Tesson and Therien [I^] give a broad-ranging survey of the many places in which 
the class of languages definable in this logic arises. Weis and Immerman [18] 
initiated the study of the hierarchy within FO 2 [<] based on alternation of quan- 
tifiers. They showed, using model-theoretic techniques, that the hierarchy is 
infinite, but finite for each fixed alphabet. 

In [T5] , the second author provided an algebraic characterization of the levels 
of the hierarchy, showing that they correspond to the levels of weakly iterated 
two-sided semidirect products of the pseudovariety J of finite ^-trivial monoids. 
This still left open the problem of decidability of the hierarchy: effectively deter- 
mining from a description of a regular language the lowest level of the hierarchy 
to which the language belongs. This problem was apparently solved in Almeida- 
Weil (2], from which explicit identities for the iterated product varieties can 
be extracted. However, an error in that paper called the correctness of these 
results into question. Here we show that the given identities do indeed character- 
ize these pseudovarieties. In particular, since it is possible to verify effectively 
whether a given finite monoid satisfies one of these identities, we obtain an 
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effective procedure for exactly determining the alternation depth of a regular 
language definable in two- variable logic. 

We show more generally that the two-sided semidirect product of a pseu- 
dovariety with J as the right-hand factor preserve decidability. That is, if we 
have an effective procedure for determining if a given finite monoid belongs to 
a variety V, then we have such a procedure for V ** J. 

At several junctures, our proof could have been shortened by appealing to 
known results about the algebra of finite categories and the topological theory 
of profinite monoids, which are the principal tools of |2.. For example, Theorem 
13.11 is really just the bonded component theorem of Tilson [T7] coupled with 
Simon's Theorem |13) on j7-trivial monoids. Lemma 14.21 closely mirrors the 
work of Almeida on the structure of the free profinite j7-trivial monoid [1 . In 
order to keep our argument accessible and self-contained, we have chosen to 
steer clear of these quite technical results. We do discuss finite categories, but 
only at the most elementary level. Avoiding profinite techniques forces us to 
give explicit size bounds, but these are of independent interest in decidability 
questions. 

We give the necessary preliminaries from algebra in Section [2] Section [3] is 
devoted to our fundamental theorem, a category-based characterization of two- 
sided semidirect products with J as the right-hand factor. We apply this result 
in Section 2] to obtain explicit identities for the levels of the hierarchy, thus 
solving the decidability problem. We use these identities in Section [5] to give a 
new proof of the result of Weis and Immerman that the hierarchy collapses for 
each fixed input alphabet. Section [6] proves the general decidability-preserving 
result for block products with J. 

After we circulated an early draft of this paper, we became aware of a number 
of related results. Kufleitner and Weil [5], building on earlier work of theirs [5], 
independently established the decidability of the levels of the alternation hierar- 
chy, using an entirely different algebraic characterization. A proof that V ** J 
is decidable if V appears in the unpublished Ph.D. thesis of Steinberg [T4"] . 

2 Preliminaries 

While the principal application of our results is in finite model theory, this paper 
contains no formal logic per se and is entirely algebraic in content. The reader 
should consult [TS] and [TH] for the definition of F0 2 [<] and the alternation 
hierarchy within it. For our purposes here, they are to be viewed simply as the 
language classes corresponding to certain varieties of finite monoids, as discussed 
below. 

2.1 Finite monoids and regular languages 

See the book by Pin [TU] for a detailed treatment of the matters discussed in 
this subsection and the next; here we give a brief informal review. 
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A monoid is a set M together with an associative operation for which there 
is an identity element 1 G M. If A is a finite alphabet, then A* is a monoid with 
concatenation of words as the multiplication. A* is the free monoid on A: this 
means that every map a : A — > M, where M is a monoid, extends in a unique 
fashion to a homomorphism from A* into M. 

Apart from free monoids, all the monoids we consider in this paper are 
finite. If M is a finite monoid, then for every element m G M there is a unique 
e G {m k : k > 1} that is idempotent, i.e., e 2 = e. We denote this element m u . 

If M, N are monoids then we say M divides N, and write M -< N, if M is a 
homomorphic image of a submonoid of N. 

We are interested in monoids because of their connection with automata and 
regular languages: A congruence on A* is an equivalence relation ~ on A* such 
that u\ ~ U2, t>i ~ t>2, implies ui^i ~ ^2^2- The classes of ~ then form a monoid 
M = A* I ~, and the map u n> [u]^ sending each word to its congruence class 
is a homomorphism. If L C i*, then =l, the syntactic congruence of L, is the 
coarsest congruence for which L is a union of congruence classes. The quotient 
monoid A* / =l is called the syntactic monoid of L and is denoted M(L). 

We say that a monoid M recognizes a language L C ^4* if there is a ho- 
momorphism a : A* — > M and a subset X of M such that cti _1 (X) = L. The 
following proposition gives the fundamental properties linking automata to finite 
monoids. 

Proposition 2.1. A language L C A* is a regular if and only if M(L) is finite. 
A monoid M recognizes L if and only if M(L) -< M. 

2.2 Varieties and identities 

A collection V of finite monoids closed under finite direct products and division 
is called a pseudovariety of finite monoids. (The prefix 'pseudo' is there because 
of the restriction to finite products, as the standard use of 'variety' in universal 
algebra does not carry this restriction.) 

Given a pseudovariety V, we consider for each finite alphabet A the set A*V 
of regular languages L C A* such that M(L) e V. We call V the variety of 
languages corresponding to the pseudovariety V. The correspondence VhjV 
is one-to-one, a consequence of the fact that every pseudovariety is generated 
by the syntactic monoids it contains. We are interested in this correspondence 
because of its connection with decidability problems for classes of regular lan- 
guages: To test whether a given language L belongs to A*V, we compute its 
syntactic monoid M(L) and test whether M(L) G V. Since the multiplication 
table of the syntactic monoid can be effectively computed from any automa- 
ton representation of L, decidability for the classes A*V reduces to determining 
whether a given finite monoid belongs to V. 

Let S be the countable alphabet X — {x\, X2, ■ ■ ■}■ A term over S is built 
from the letters by concatenation and application of a unary operation v . 
For example, (xiX2) u 'xi is a term. We will interpret these terms in finite monoids 
in the obvious way, by considering a valuation ip : S — > M and extending it to 
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terms by giving concatenation and the uj operator their usual meaning in M. 
For this reason, we do not distinguish between (uv)w and u(vw), where u,v 
and w are themselves terms, nor between terms u u and {u u y , as these will be 
equivalent under every valuation. 

An identity is a formal equation u = v, where u and v are terms. We say that 
a monoid M satisfies the identity, and write M \= (u = v), if it and v are equal 
under every valuation into M. The family of all finite monoids satisfying a given 
set of identities is a pseudovariety, and we say that the pseudovariety is defined by 
the set of identities. We must stress that the identities we consider here are very 
special instances of a much more general class of pseudoidentities. Under this 
broader definition, every pseudovariety is defined by a set of pseudoidentities. 
See, for instance, Almeida pQ. If a pseudovariety V is defined by a finite set of 
identities of the form we described, then membership of a given finite monoid M 
in V is decidable, since we only need substitute elements of V for the variables 
in the identities in every way possible, and check that equality holds in each 
case. 

We consider four particular pseudovarieties that will be of importance in this 
paper. (In presenting identities we will relax the formal requirement that all 
terms are over the alphabet {x\, X2, ■ ■ ■}, and use a larger assortment of letters 
for the variables.) 

Ap The pseudovariety Ap consists of the aperiodic finite monoids, those that 
contain no nontrivial groups. It is defined by the identity x u = xx u . If A 
is a finite alphabet and L C A* is a regular language, then M(L) £ Ap 
if and only if L is definable by a first-order sentence over < . In other 
words, the first-order definable languages form the variety of languages 
corresponding to Ap. 

DA The pseudovariety DA is defined by the pair of identities 

{xyz) u y{xyz) u = (xyz) u . 

There are many equivalent characterizations of this pseudovariety in terms 
of other identities, the ideal structure of the monoids, and logic. For us 
the most important ones are these: First, DA is also defined by the 
identities 

{x y y{yxy{x y y = { xy y,x" = xx«. 

Second, let e £ M be idempotent, and let M e be the submonoid of M 
generated by the elements m £ M for which e £ MmM. Then M £ DA 
if and only if e = eM e e for all idempotents e of M. Finally, if L C A* 
is a regular language, then M(L) £ DA if and only if L is definable in 
F0 2 [<]. In other words, the two- variable definable languages form the 
variety of languages corresponding to DA. 

J The pseudovariety J consists of finite monoids that satisfy the pair of 
identities 

( xy y = (yxy,x" = xx". 
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This is equivalent to the identities 

{xy) u x = y(xy) u = (xy) u . 

Alternatively, J consists of finite monoids M such that for all s,t £ M, 
MsM — MtM implies s — t. Such monoids are said to be J~-tiivial. 

A theorem due to I. Simon [T3] describes the regular languages whose 
syntactic monoids are in J. Let w 6 A*. We say that v = a% ■ ■ ■ a^, where 
each a, £ A, is a subword of w if 

w = w aiwi ■ ■ ■ a k w k 

for some Wi £ A. We define an equivalence relation on A* that identi- 
fies two words if and only if they contain the same subwords of length no 
more than k. (In particular, Wi ~i u>2 if and only if w\ and u>2 contain 
the same set of letters.) Simon's theorem is: 

Theorem 2.2. Let (f> : A* — > M a homomorphism onto a finite monoid. 
Then the following are equivalent: 

• M e J. 

• There exists k > 1 such that if w ~fe w' , then (f>(w) = <f>(w'). (In 
particular, M is a quotient of A* / .) 

It is easy to show that the second condition implies the first; the deep 
content of the theorem is the converse implication. The theorem can also 
be formulated in first-order logic: The variety of languages correspond- 
ing to J consists of languages definable by boolean combinations of Si 
sentences over <. 

Ji The pseudovariety Ji consists of all idcmpotcnt and commutative monoids; 
i.e., those finite monoids that satisfy the identities x 1 = x,xy = yx. A 
language L C A* is in the variety of languages corresponding to Ji if and 
only if it is a union of ^i-classcs. It is well known, and easy to show, that 

Ji C J C DA C Ap, 

and all the inclusions are proper. 

2.3 Two-sided Semidirect Products 

In this section we describe an operation on pseudovarieties of finite monoids, 
the two-sided semidirect product. This was given its formal description by 
Rhodes and Tilson [llj , but it has precursors in automata theory in the work of 
Schiitzenberger on sequential bimachines [H] , Krohn, Mateosian and Rhodes [7] , 
and Eilenberg on triple products [1] . Traditionally, one begins with a two-sided 
semidirect product operation on monoids, and then uses this to define the cor- 
responding operation on pseudovarieties. Here we find it simpler to define the 
operation on varieties directly. 



5 



Let A be a finite alphabet, and ijj : A* — ► N a homomorphism into a finite 
monoid. Let S = N x A x N, which we treat as a new finite alphabet. We dchnc 
a length-preserving transduction (not a homomorphism) : A* — ¥ E* by 

r^(ai ■ ■ ■ a n ) = ax ■ ■ ■ a n , where 

<Ji = (V>(ai • • • ai_i), en, ip(a i+ i ■ ■ ■ a k )) £ £. 

(If i = 1, we interpret the right-hand side as (1, 01,^(02 ■ ■ 'flfc)), and similarly 
if i = ri.) 

Let V and W be pseudovarieties of finite monoids. Let M be a finite monoid, 
and let <f> : A* — > M be a surjective homomorphism. We say that Me V«W 
if and only if there exist homomorphisms 

ip : A* -> N £ W, 

ft : (iV x A x N)* -> X S V, 

such that factors through (ft o r^,ip) — in other words, for all v, w £ A*, if 
ip(v) = ip(w) and ft(r^,(u)) = h(r^(w)), then 0(v) = 4>(w). It is not difficult 
to check that this is independent of the alphabet A and the homomorphism <p, 
and is thus determined entirely by M, and that furthermore V ** W forms a 
pseudovariety of finite monoids. We will treat this as the definition of V ** W, 
but it is also straightforward to verify that this coincides with the pseudovariety 
generated by two-sided semidirect products K ** N, where K £ V and N £ W. 

We define a sequence {Vi}i>i of pseudovarieties by setting Vi = J, and, for 
i > 1, Vj + i = Vj J. The main result of [15] is that DA is the union of the 
pseudovarieties , and that the variety of languages corresponding to Vj is the 
i th level of the alternation hierarchy within F0 2 [<]. 

2.4 Finite categories 

We give a brief account of the tools from the algebraic theory of finite categories 
needed to prove our main results. The original papers of Tilson [17] and Rhodes 
and Tilson [11] give a complete and careful exposition of the general theory. 

The categories studied in category theory are typically big categories, in 
which the object class consists of something like all topological spaces, and the 
arrows are all continuous functions. The work of Tilson [17] showed the utility 
of studying very small categories in which the object set, as well as each set of 
arrows between two objects, is finite. 

A category C consists of a set of objects obj(C), a set of arrows hom(yl,i?) 
from A to B for all A, B £ obj(C), and a associative partial binary operations 
o : hom(A B) x hom(B, C) -t hom(A, C) for all A,B,C £ obj(C) called compo- 
sition, such that there is an identity in hom(A, B) for all A,Be obj(C). 

In this view, a finite monoid is simply a category with a single object, and 
a finite category is consequently a generalized finite monoid. 

Let A be a finite alphabet, M and N finite monoids with homomorphisms 



where <j> maps onto M. We will define a finite category, which we call the kernel 
category ker(^> o 4>~ l ). The objects of ker(i/> o are pairs (ni,n 2 ) G TV x 7V0 
The arrows are represented by triples 

(m,n 2 ) A KjHj), 

where u d A* , n\ — n\ ■ ip(u) and tp(u) ' n 2 — n i- Whenever we have a pair of 
consecutive arrows 

/ \ u / / i \ ( i i \ v i ii ii\ 
(ni,n 2 ) ->■ (n. l7 n 2 ), (n l5 n 2 ) -4 [n 1 ,n 2 ), 

then we can define the product arrow 

If this were all there were to arrows in the kernel category, we would in general 
have an infinite set of arrows between two objects. However, we identify two 
coterminal arrows 

(ni,n 2 ) ► («i,« 2 ) 

if for all v, w G A* with ip(y) = n-y, ip{w) = n' 2 , 

(j)(vuw) = 4>{vu'w). 

It is easy to check that this identification is compatible with the product on 
consecutive arrows, so the true arrows of ker(?/> o (f>~ 1 ) are equivalence classes 
modulo this identification. In particular, the finiteness of M and N implies that 
there are only finitely many distinct arrows 

If (711,712) — (n^n^), then any pair of arrows from (711,712) to itself are 
consecutive, and thus the set of all such arrows at (711,712) is a finite monoid, 
which we denote M nii „ 2 . This is a base monoid. Base monoids, then, are just 
built from words u satisfying 711 ■ tp(u) = n±, and ip(u) ■ 712 = 7i 2 , an d collapsing 
modulo the equivalence relation identifying arrows. 

The following Lemma concerning the structure of the base monoids will be 
quite useful. 

Lemma 2.3. Let A be a finite alphabet: M,N,N' finite monoids, and consider 
homomorphisms 

m^La*^n^\ n', 

where <j> maps onto M. Then every base monoid of ker(tp o </> _1 ) divides some 
base monoid of]seT((ip'ip) o (f)^ 1 ). 

1 The odd notation for the kernel category is used to maintain consistency with the tradi- 
tional setting for these finite categories, ip o <j>~ 1 is a relational morphism from M to N, and 
Tilson defines these categories for arbitrary relational morphisms, not just those derived from 
morphisms of the free monoid. 
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Proof. Let n l7 n 2 G N. We denote by Mi the base monoid at (m , ri2) in ker(-0 o </> 1 ) , 
and by M 2 the base monoid at (n[,n 2 ) = ip(n 2 )) in ker((^>'V>) o^ 1 ). 

Set 

U = {u G A* : m ■ ip(u) = Tlx, n 2 — ip(u) ■ n 2 }, 

U' = {ueA* : n[ ■ <ip'ip(u) = n\,n' 2 = ip'ip{u) ■ n' 2 }. 

U and U' are submonoids of A*, and U C [/'. Mi and M 2 are the quotients 
of U and £7' by the congruences identifying equivalent arrows in the respective 
categories. Let u,u' G U represent equivalent arrows of M 2 , and suppose v, w G 
A* are such that ip{v) = rii, ip{w) = n 2 . Then ip'ip(v) = n'i, ip'ipiw) — n' 2 , so 
by equivalence in M 2 we have ^(vuw) = <p(vu'w). But this means that u and 
u' represent equivalent arrows in Mi, so Mi is a quotient of the image of U in 
M 2 . Thus Mi <M 2 . □ 

It is worth keeping in mind the somewhat counterintuitive message of this 
lemma: The category ker(i/) o </> -1 ) is bigger (it has more objects) than ker((^'-0) o (p^ 1 ) 
but its base monoids are smaller. 

The reason for the construction of the kernel category is its relation to two- 
sided semidirect products. Roughly speaking, M G V ** W if and only if there 
exists ip : A* — > N G W such that the category ker(-0 ° is 'globally in 
V. We will not define this precisely, but instead prove the consequence that if 
M G V ** W, then ker(i/> o (f)" 1 ) satisfies a weaker condition of being 'locally in 
V.' 

Proposition 2.4. Let <f> : A* —> M be a homomorphism mapping onto M. If 
M G V ** W, then there is a homomorphism tp : A* — > N G W such that each 
base monoid o/ker(?/; o <f)~ l ) is in V. 

Proof. Since M G V ** W, there exist homomorphisms ip : A* — > N G W 
and h : S* — > K G V, where S = N x A x N satisfying the conditions in the 
definition of the product variety. Let n\,n 2 G N, and let M' denote the base 
monoid at (m, n 2 ). As in the proof of Lemma 12.31 we set 

U = {u G A* : Tlx ■ ip(u) = ni, n 2 — ip{u) ■ n 2 }. 

If a = (n, a, n') G E, we set 

cr™ 2 = (n, a,n'n 2 ) G E, 

ni a = (nxn, a, n') G S, 

"V" 2 = (nxn,a,n'n 2 ) G S. 

We extend these actions to £*: If z = ax ■ ■ -o- n G E*, then z" 2 = cri™ 2 • • • ct„™ 2 , 
and similarly for the other two operations. For u G U we set a(u) = h( ni (u)™ 2 ) . 
We then have a(uu') = a(u)a(u') for u, u' G U. (Observe that this property does 
not hold for arbitrary u,u' G A*, but depends on the fact that words in U sta- 
bilize nx on the right and n 2 on the left.) Thus a is a homomorphism from U 
into K. 



8 



Suppose now that a(u) = a(u'). Let v,w £ A* with ip( v ) = n i, ^{w) = n%- 
Then 

h{T^(vuw)) = h(T^(v) n2 ) ■ a(u) ■ h( ni T^{w)), 

and similarly 

/i(^H) = M^»" 2 ) ■ a(u') • ft( ni T^(«;)). 

So h(T^(vuw)) = h(T^(vu'w)), and ip(vuw) — nxn^ — ip(vu'w), so 4>(vuw) = 
4>(vu'w). Consequently w, v! represent the same element of the base monoid at 
(ni, n2). Thus the map taking u £ U to the corresponding element of M' factors 
through the homomorphism a, so M' -< if, giving M' S V. □ 

3 A local-global theorem for categories 



In general, the converse of Proposition 12.41 is false. This section is devoted to 
establishing an important instance in which it is true, namely when W = J. 

Theorem 3.1. Let A be a finite alphabet, M and N finite monoids with N £ J 
and homomorphisms 

M^i'A N. 

Suppose V is a pseudovariety of finite monoids with Ji C V. If every base 
monoid o/ker(-0 o 4>~ l ) is in V, then M £ V ** J. 

Proof. It follows from Theorem 12.21 that for some k > 0, i/j factors through 
the homomorphism A* — > A* / ~& identifying two words that have the same 
subwords up to length k. By Lemma 12.31 we may assume that tp is this ho- 
momorphism, and that N = A* / ~£ . In particular, if w £ A*, then we can 
represent ip(w) as the set of subwords of w of length no more than k. (When we 
need to emphasize the dependence of ip on the chosen subword length, we will 
write it as ipk-) 

The set V(N x N) of subsets of N x N forms an idempotent and commutative 
monoid with union as the operation, and hence belongs to V. Let £ = N x A x N 
and let hu be the homomorphism 

hu : S* -> V(N x N) 

by a = (P, a, S)^{(P,S)} 
for each a £ S. Given P, S £ N, define a homomorphism 

h P ,s : £* -> Mrs 

by mapping (P', a, 5") £ E to the arrow class of (P, a, S) if P = P', 5 = <S", and 
to 1 G Afp.s otherwise. Finally, set M' to be the direct product 

M' = V(N x N) x J| Mp, s , 

(P,S)eNxN 
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and set 

h = hjj x J [ hp. s . 

(P,S)£NxN 

By our hypothesis M' e V. 

Let w, w 1 £ A*, with vp(w) = tp(w') and h{r^(w)) = h(T^(w')). We will show 
4>{w) = 4>(w'), which gives the result. 

We will look at the paths through ker(ip o cf)^ 1 ) traced out by w and w' .. 
Since ip(w) = ip(w'), the two paths are cotcrminal, beginning at the object 
(l,ip(w)) and ending at (ip(w), 1). If the i th letter of w is a i: then the i th arrow 
on this path is the class of 

{Pi-l, Si-i) — ^ (Pi, Si), 

where Pj is ip(u) for the prefix a\ ■ ■ ■ a,j-i of j — 1 of w, and likewise Sj = tp(v) 
for the suffix v = aj + i ■ ■ ■ a\ w \. If 

(P,S)^(P',S') 

is on the path traced by w, then we have P C P' and S' C S. Either P = P' and 
S = S', in which case this arrow belongs to one of the base monoids, or at least 
one of the inclusions is proper. Since hu{r{w)) — hu(r(w')), the same pairs 
(P,S), (P',S') must occur in the path traced by w' . Because of the inclusions, 
they must occur in the same relative order in this path, with (P, S) preceding 
(P', S'). They also must be adjacent in this path, since if there were a third pair 
(P", S") between them, we would have 

PCP"c P', S' c 5"' c S, 

so this new pair would have to occur in the original path traced by w, strictly 
between (P,S) and (P',5"). Finally, the letter a labeling the arrow joining 
these two objects in the respective paths is completely determined by (P, S) 
and (P',S"). This is because at least one of the two inclusions P C P' and 
5" C S is proper. Assume without loss of generality that the first of these is a 
proper inclusion. Then P' contains a word that is not in P, and the last letter 
of this word is a. 

Thus our two paths are depicted by the diagram below: 



Uq Ml Mr 
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The paths traverse exactly the same sequence of distinct objects 

(i, v-H) = (Jtf, s'j), (p{', s'{), . . . , (p;, 5;') = (tp( w ), 1). 

The arrow joining (Pj_i,Sj_i) and (P",S") in both these paths is the same 
letter a". For j = 0, . . . , r each path contains a loop at (P'J , S'j) labeled by a 
factor uj of w in one path, and a factor u'- in the other path. We have 

= uoa' x u\ ■ ■ ■ a"u r , 

w' = u' Q aiu[ ■ ■ ■ a"u' r . 
Let wo = w, and for j = 0, . . . , r, let 

Wj+i = u' a" ■ ■ ■ u'ja" +l Uj+i ■ ■ ■ a r u r , 

so that u>,+i = w' . In other words, we transform w into w' one step at a time, 
changing each Uj in succession to u'-. We claim that at each step, 4>(wj) — 
<j)(wj + i), so that we will get </>(w) = <j>(w'), as required. Let (P, S) = (P",Sj). 
Then, by the definition of the homomorphisms /ip,s, we have hp^s(r,p(w)) = 
hp,s(Tip(uj)), h Pt s{T^(w')) = h^sir^iu'j)), and thus h Pt s{T^{uj)) = ftp,s(T^(it^)) 
This means that 

Uj ,11 ■ 

(P,S)^(P,S) 

are equivalent arrows. Thus 

<f>(wj) = (f){u' a'l ■ ■ ■ Uj^a'jUja"^ ■■•u r ) 

= 4>{u' Q a'{ ■ ■ ■ u' j _ 1 a'ju' j a' : l +1 ■ ■ ■ u r ) = <p(w j+1 ). 

□ 

We remark that the hypothesis Ji C V is actually not necessary. If V is 
a pseudovariety of monoids that does not contain Ji, then every member of V 
is a group, and it is know that the converse of Proposition 12.41 holds when V 
contains only groups; this follows from results in [17J . We do not require this 
fact in our main application to the alternation hierarchy. 

4 Effective characterization of levels of the al- 
ternation hierarchy 

We now define a sequence of identities that will allow us to characterize the 
varieties V„. We set 

Ui = (xiX2) U ,Vi = (x 2 xi) w , 

and for n > 1, 

U n +1 = {Xf- X 2n X2n+\Y U n {x 2n +2X\ • • • X 2 nT , 
V n+ i = (Xx ■ ■ ■ X 2 nX2n+lTv n (x 2 n+2Xl • ■ ■ X 2n ) U '. 
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Theorem 4.1. Let n > 1. M £ V„ i/ and only if M |= (u n = v n ), and 
M \= (x" = xx u ). 

As we remarked above, when a pseudovariety Vis denned by a finite set of 
identities of this type, one can decide membership in V. Since the levels of the 
alternation hierarchy in F0 2 [<] are the varieties of languages corresponding to 
the Vi, the alternation depth of a language in F0 2 [<] is effectively computable. 

Proof. The 'only if part (the identities hold in V„) is proved in 115] , so we will 
just give the proof of the 'if part (sufficiency of the identities). 

We prove the theorem by induction on n. It is well known that the identities 
u\ = Vi, x u — xx u characterize Vi = J. 

So we assume n > 1 and suppose that M \= (u n = v n ). We let 4>, ip be as in 
the previous section, so that <f> is any morphism mapping onto M £ V n , and if) 
depends on the choice of a subword length K. We will show that if K is chosen to 
be a large enough value, then each base monoid Mp,s of the category ker^o^ -1 ) 
satisfies the identity it„_i = u n _i. By the inductive hypothesis, this implies that 
each Mp.s belongs to V„_i, and thus by Theorem l3.11 M £ V n _i ** J = V„. 

We let x±, . . . x 2 ( n -i) be any elements of Mp y $- Thus, each Xi is represented 
by a triple 

(RS)^(RS), 

where w, € A*, P ■ ip{w l ) = P, ip{w t ) ■ S = S. 

We construct words W n -i 1 W! n _ 1 £ A* by replacing each xi in u n -i (respec- 
tively Vn-i) by Wi. We will think of uj in these strings as representing a finite 
exponent N such that x N = x N+1 for all x £ M, and hence for all x £ Mp t s- 
Thus if n > 2, 

W n -i = (wiW 2 ■ ■ ■ W 2n -3) N W n - 2 (w2n-2Wi ■ ■ -W 2 n~i) N , 
W n _ x = (wiW 2 ■ ■ ■ W 2n _ 3 ) Ar W^_ 2 (w 2 n-2U'l ' ■ ■ W2n~4) N - 

In the special case n — 2, we have W\ = (w 1 w 2 ) JV , W[ = (w 2 Wi) N . 

Let z, y £ A* with ip(z) — P, ip(y) — S. If w £ A* , we denote by a(w) the set 
of letters occurring in A* . We also denote by B the set a{W n ~i) — a(Wl l __ 1 ). 

Lemma 4.2. If K > \M\ ■ (\A\ 2 + \A\)/2, then z has a suffix z' with a factor- 
ization 

z' = ZlZ 2 ■ ■ ■ Z|M| 

where 

B C a(zi) = a(z 2 ) = ■■■ = a(z\ M \), 
and, likewise, y has a prefix y' with a factorization 

y' = 2/12/2 • ■ • v\m\ 

where 

B C a(yi) = a(y 2 ) = ■■■ = a(y\ M \). 
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Assuming the lemma, we will now complete the proof of Theorem 14. II Since 
M \= (u n — v n ), we obtain M |= (xy)^ (yx)^ (xy)^ = (xy)" by setting x\ = x, 
x 2 = y, and x k = 1 for k > 2. Thus M G DA. 

We can write z — z" V ', where z' = zi...2|Afi has a factorization as in 
Lemma 14.21 By the standard pumping argument, it follows that there are 
indices i < j such that <f>{z\ . . . Zj_i)0(z, • • • Zj) = <f>(z\ . . . Zj_i), and thus 



{z\ . . . Zi-i)4>(zi ■ ■ ■ ZjY = 4>(zi . . . Zi-i). 



If we now set 



e = tf>(zi ■ ■ ■ ZjY 
S2n-i = e • (j>{zj+i ■ ■ ■ Z\ M \), and 
Si = 4>{wi) for i < In — 1, 
we obtain, from the identity e • M e • e = e, 

000 = 



z x . 


■ ■ Z»- 


l) ' 0Oj'+l ■ ' ' 


z |Af|) 


Z X . 


■ • Z%- 


,i)e ■ e<t>(zj + i 


•••^|M|) 


Zi . 


..Zi- 


-i) e • (0(^+1 


' ' •^|Af|)(si • ••S2n-l) W 


Zi . 


■ ■ Z l - 


. x )e • (p(z j+ i ■ ■ 


•■^|M|)(S1 •■•S2n-l) W 


*')• 


•(*!■ 


■•«2»-ir. 





The third equality above holds because by Lemma 14.21 z,-_|_i ■ • • Zj contains all 
the letters that occur in the z k and the Wk, and hence all the values we inserted 
between occurrences of e belong to M e . 

Similarly, using the part of Lemma 14.21 concerning the prefix of y, we find a 
value S2n such that 4>(y') = (s2„si • • ■ S2n-2) LJ 4>{y')- Since M |= (u n = v„) we 
obtain 

<j>(zW n - lV ) = <f>(z")<t>(z , )<t>(W n - 1 )<f>(y , )<t>(y") 

= 0(z")0(z')(si • • • s 2n - 1 ) u 4>(W n - 1 )(s 2n s 1 ■ ■ • S2 „_ 2 r0( y ')0(y") 

= j>[z")<t>{z'){s x ■ ■ ■ a an -l) W 0«-l)(«2n*l • • • ^n-aJW W) 

= HzW^y). 

But this means that Mp^s \= (u n —i = u n _i), as we required. □ 
We now turn to the proof of Lemma [ 



Proof of Lemma \4-2\ By symmetry, we only need to treat the part concerning 
the suffix of z. Recall that ip(z) = iPk(z) is the set of subwords of length no 
more than K in z, and that tp(zb) = ip(z) for all teB. 

We will show that if B C a(z) and ipr(zb) = ipr(z) for all b E B, where 

T = \M\ ■ (k 2 + k)/2, 
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and k — \a(z)\, then z contains a suffix with the required properties. This gives 
the lemma, because ipxizb) = iPk(z) implies ipT(zb) = ipriz) for any a(z) C A. 

The proof is by induction on |a(z)|. The base case is when a(z) = B. Let 
B = {61, . . . &,.}. By repeated application of ip(zbi) = ip(z) we find {b\ ■ ■ ■ 6 r )' M ', 
which has length \M\\B\ < \M\(\B\ 2 + \B\)/2, is a subword of z. The base 
case occurs when B — a(z). In this case we can simply take z' = z and factor 
z = z\ - ■ ■ z\m\ 1 where each z$ contains one of the factors b\ ■ ■ ■ b r as a subword. 

We thus suppose that a(z) = A' contains B as a proper subset. Let N — \A'\. 
We look at the longest subword t\ ■ ■ ■ t p of z such that a(tj) — A'. We must have 
p > 1. If p > |M|, we can again take z' = z and factor z as z\ ■ ■ ■ Zi^i, where 
each Zi contains ti as a subword. If p < \M\, we let s = \A'\, then we write 

tl " ' ' tp = a l a 2 ' ' ' a psj arL d 

z = zqo,\Z\ ■ ■ ■ a ps z ps . 

We further suppose that this factorization represents the leftmost occurrence of 
Qj\ ' ' ' dps as a subword of z, in other words that z ps has maximum possible length 
for this property. Note that a(z ps ) is a strict subset of A' , for otherwise z would 
have contained a longer subword t\ ■ ■ - t p+ i with a(ti) = A'. Thus |a(z ps )| < 
N - 1. Set T = \M\ ■ ((N - l) 2 + (N — l))/2. We must have ip T (z ps b) = ip T {z ps ) 
for all b € B. If not, there is a subword u of z ps of length less than T such that 
ub is not a subword of z ps . However ti ■ ■ ■ t p ub has length no more than 

(\M\ - 1) • N + T < \M\ ■ (N+((N- l) 2 + (N — l))/2) = \M\ ■ (N 2 +N)/2, 

and is accordingly a subword of 2, and thus there is a strictly earlier occurrence 
of ti ■ ■ ■ t p as a subword of z, a contradiction. We can thus apply the inductive 
hypothesis to z ps and conclude that z ps contains a suffix of the required type. □ 

5 Collapse of the hierarchy 

In the original model-theoretic study of the alternation hierarchy in F0 2 [<], 
Weis and Immerman ;18J showed that while the hierarchy is strict, it collapses 
for each fixed-size alphabet. An algebraic proof of strictness was given in [T5] . 
using the identities that form the subject of the present paper. Here we use 
these techniques to prove the collapse result. 

Theorem 5.1. Let n > 0. If M G DA is generated by n elements, then M € 
V„. 

In particular for any fixed alphabet the quantifier alternation hierarchy col- 
lapses. 

Corollary 5.2. Any language over a k-letter alphabet definable by a two-variable 
sentence is definable by one in which the number of quantifier blocks is k. 
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Proof of Theorem \5.1[ We prove by induction on n that if M is generated by 
n elements and M |= (un — Wjv) for some N > n, then M |= (u n = v n ). 
By Theorem 14.11 this implies the result. Every monoid with one generator is 
commutative, which gives the result for n = 1. We now let n > 1, and suppose 
M \= (ujv = vn). Let X\,..., X 2n G M. Consider the valuation that maps each 
variable Xi to Xj and let Uj ,Vj S M for j < n be the resulting valuations of the 
terms Uj , Vj . We suppose that M is generated by n elements, so that each Xi 
itself is a product of these generators. 

We consider two cases: If X\, . . . ,X 2n -2 can all be written as products of 
elements of some strict subset of these n generators, then all the Xi for i < 2n—2 
belong to an (n — l)-generated submonoid M' of M. It follows by the inductive 
hypothesis that M' |= = v n -i), and thus J7 n _i = V n -\. 

In the second case, X\ ■ ■ ■ X 2n -2 can be written as a product involving all n 
generators. To ease notation we let 

A = \X\ ■ ■ ■ Xzn-lf , 
B = (X2 n X\ ■ ■ ■ X2U-2 Y ' ■ 

Then U n = AU n -±B = AAU n -iBB, since A and B are idempotent. We 
know M G DA, and every Xi (for i = 1, . . . , 2N) is a product of generators 
appearing in X\, . . . X2 n ~2, and hence in A and B. It follows that all generators 
are in Ma and Mb- We can repeatedly used the identity e = e • M e ■ e to insert 
any product of generators — in particular, any Xj — between two occurrences 
of either of the idempotents A or B. 

If we set Xj = 1 for j = 2n + 1, . . . , 2JV, we get that 

Un = (X\ . . . X2n) UJ U n {X\ . . . X 2n ) UJ ■ 

Thus 

U n = AAUn-iBB 

= A{X 1 ...X 2n yAU n - 1 B{X l ...X 2n rB 

= A{x x . . . x 2n Yu n {x x . . . x 2n ys 

= AU N B 

Likewise V n — AVnB. Since M (= (un = Vn), w c get U n — V n . As the valuation 
on xi, . . . , x 2n was arbitrary, we have M \= (u n — v n ). □ 

6 General decidability results 

Here we show that for arbitrary pseudovarieties V, the operation V 4 V ** J 
preserve decidability. This of course implies our result (a consequence of Theo- 
rem [Li} that the varieties Vj are all decidable, but Theorem 14. II is a sharper 
result, since it gives explicit identities. As we remarked in the introduction, 
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the general decidability result was originally proved by Steinberg Q3], but not 
previously published. Our approach has the advantages both of being relatively 
elementary, and yielding explicit bounds on the complexity of membership test- 
ing. 

We suppose that <fi : A* — > M is a surjective homomorphism onto a finite 
monoid. Let N > 0, we denote by ker n cf> the category ker(ipN ° where 
ip ]v is the natural projection of A* onto the quotient A* / . We set 

# = |M|-(L4| 2 + L4|)/2 

as in the statement of Lemma 14.21 With these notations we have: 

Theorem 6.1. Let V be a pseudovariety of monoids. MeV«J if and only 
if every base monoid of ker^ <fi is in V . 

We can effectively compute all the objects and arrow classes of ker^ </> from 
4>, and we can also take A = M and <fi to be the extension of the identity map 
on M to A*. The theorem thus immediately implies 

Corollary 6.2. //V is a decidable pseudovariety of finite monoids, then so is 

V ** J. 

The remainder of the section is devoted to the proof of Theorem l6.ll Strictly 
speaking, our argument is complete only in the case where Ji C V, but see the 
remark at the end of Section |31 which implies that our proof is valid in all cases. 

Proof of Theorem \6.1\ If all the base monoids of kcix (f> are in V, then M € 

V ** J by Theorem 13. II For the converse, we suppose MeV«J. Then, again 
by Theorem l3.11 there exists K' such that every base monoid of ker^/ tfi is in V. 

If K' < K, the desired result follows directly from Lemma lemma:basemonoids. 
So we assume K' > K. We will need the special properties of the morphisms 
4>n given in Lemma T4. 2 1 Let (P, S) be an object of ker^f (p. Set 

B = {b G A : P ■ i> K {b) = P,^ K (b) ■S = S}. 

The base monoid Mp.s is generated by the arrows (P, S) \ (P, S), and conse- 
quently we obtain a homomorphism p from B* onto Mp g. 

Let z, y 6 A* be such that ^k{z) = P ^k(v) = S. By Lemma there are 
factorizations 

z = z Q zi ■ ■ ■ z\ M \,y = y\ M \ • --ym, 

where for all 1 < i,j < \M\, B C a(zi) = a(zj) and B C a(yi) — a(yj). 
We also have, for some 1 < i < j < \M\, 

<f>(Zi ■ ■ ■ Zi) = 4>{Z X ■ ■ ■ ZiZ i+ i ■ ■ ■ Zj), 

so we can insert arbitrarily many copies of Zi + ± ■ ■ ■ Zj into z (and likewise into 
y) without changing the value of the word under cf>. Let us do this in such a 
manner that the resulting words 

z' = z Q zi ■ ■ ■ z r ,y' = y r ■ ■ ■ yiy 
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contain r > K' of the factors Zk and respectively. Let P' = ipK'(z') and 
S' = ipK>{y')- It follows that P'-iP K ,{b) = P' for every b e B, for if a ■ --CK'-ib 
is a subword of z'6 that uses the final letter of z'b, then we can find an occurrence 
of ci • • ■ ck'-i contained entirely in the factors zq ■ ■ ■ z r —\ of z' , and consequently 
an occurrence of c\ ■ ■ ■ CK'-ib in z'. Likewise S' = ipK'(b) ■ S' for every b G B. It 

follows that (P' , S') —> (P' , S r ) represents an arrow of Mpi s>- 

We take all the objects (P' . S') that arise in this manner from representatives 
z,y of (P,S) and form the direct product N' of the resulting Mp>.s>- For each 
deBwe take the element of N' whose value in each component Mp/^s' is the 

arrow represented by (P' , S 1 ) \ (P 1 , S'), and we form the submonoid N of N' 
generated by these elements. We thus have a homomorphism a from B* onto 
AT. 

Let w,w' E B* with a(w) = a(w'). We claim p(w) = p(w'). Indeed, let 
z, y € A* with ipi<(z) = P and ipK(y) = S. We obtain z',y' from z,y as 
above, and set P' = i/)k'(z), S' — %jjK>{y)- (P',3') is one of the objects used 
to build the direct product N', so a(w) — a(w') implies in particular that 

(P',S') W ' W > (P',S') are equivalent arrows. Thus 

4>(zwy) = (p(z'wy') = 4>{z'w'y') = <fi(zwy). 

Thus (P, S) — -> (P, S) are equivalent arrows, so p(w) = p[w'), as claimed. 
Thus Mp.5 is a homomorphic image of N. thus a divisor of N' and consequently 
in V. This shows that all base monoids of ker^ <f> are in V, as required. □ 

7 Conclusion 

We have shown that the identities given in [15] indeed characterize V„. There is, 
or course, a one-sided semidirect product, which has been much more thoroughly 
studied. Our results, and their proofs, can all be adapted to one-sided products, 
with little modification. In this case, the hierarchy collapses at the second level: 
J*J*J = J*J. (This fact is not new. It has long been known that the closure 
of J under one-sided products is the pseudovariety R of 7?.-trivial monoids, and 
Brzozowski and Fich [3] showed R = Ji * J.) 

In their Paper Kufleitner and Weil [5] give a completely different characteri- 
zation of the levels of F0 2 [<]. It would be nice to see direct connection between 
these two approaches. 

Acknowledgements We are grateful to Manfred Kufleitner, Benjamin Stein- 
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